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Abstract 



o 

£ 

We consider the rate of piecewise constant approximation to a locally sta 
| tionary process X(t),t £ [0, 1], having a variable smoothness index a(t). As 

suming that a(-) attains its unique minimum at zero and satisfies 

Oh. a(t) = a + btr i + o(i 7 ) asi^O, 

we propose a method for construction of observation points (composite dilated 
design) such that the integrated mean square error 

i K 

E{(X(t) - XJt)) 2 \dt — — j- asn^oo. 

o u w ny " 5 n a o(logn)( Q o+i)/7 

ly-j ■ where a piecewise constant approximation X n is based on N(n) ~ n observa- 

tions of X. Further, we prove that the suggested approximation rate is optimal, 
and then show how to find an optimal constant K . 

o 

£N) ■ Keywords: locally stationary processes, multifractional Brownian motion, piece- 

wise constant approximation. 

> : 

X- 

1 Introduction 

■ 

Probabilistic models based on the locally stationary processes with variable smooth- 
ness became recently an object of interest and a convenient tool for applications in 
various areas (such as modelling of internet traffic or artificial landscapes, finance, 
geophysics, biomedicine, etc) due to their flexibility. The most known representative 
random process of this class is a multifractional Brownian motion (mBm) indepen- 
dently introduced in [B] and [22] . A more general class of a(t)-locally stationary 
Gaussian processes with a variable smoothness index a(t),t € [0, 1], was elaborated 
in [TS] . We refer to [5] for a survey and to [H El E3 EZ] for studies of particular 
aspects of mBm. 

Whenever we need to model such processes with a given accuracy, the approxi- 
mation (time discretization) accuracy has to be evaluated. 

More specifically, consider a random process X(t), t G [0,1], with finite sec- 
ond moment and variable quadratic mean smoothness (see precise definition ([I]) 
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below). The process X is observed at N = N{n) points and a piecewise constant 
approximation X n is built upon these observations. The approximation perfor- 
mance on the entire interval is measured by integrated mean square error (IMSE) 
j , E{(X(i) - X n {t)) 2 }dt. We construct a sequence of sampling designs (i.e., sets 
of observation points) taking into account the varying smoothness of X such that 
on a class of processes, the IMSE decreases faster when compared to conventional 
regular sampling designs (see, e.g., [2B]) or to quasi-regular designs, [2j, used for 
approximation of locally stationary random processes and random processes with 
an isolated singularity point, respectively. 

The approximation results obtained in this paper can be used in various prob- 
lems in signal processing, e.g., in optimization of compressing digitized signals, (see, 
e-g-, |12|). in numerical analysis of random functions (see, e.g., [II Q3J Q3] ) , in sim- 
ulation studies with controlled accuracy for functionals on realizations of random 
processes (see, e.g., [HCE]). ^ * s known that piecewise constant approximation gives 
an optimal rate for certain class of continuous random processes satisfying Holder 
condition (see, e.g., 126] ). In this paper we develop a technique improving this 
rate for a certain class of locally stationary processes with variable smoothness. The 
developed technique can be generalized for more advanced approximation methods 
(e.g., Hermite splines) and various classes of random processes and fields. 

Some related approximation results for continuous and smooth random functions 
can be found in [TSJ Q2J ES] . The book [23] contains a very detailed survey of various 
random function approximation problems. 

The paper is organized as follows. In Section [JJ we specify the problem setting. 
We recall a notion of locally stationary process, introduce a class of piecewise con- 
stant approximation processes, and define integrated mean square error (IMSE) as a 
measure of approximation accuracy. Furthermore, we introduce a special method of 
composite dilated sampling designs that suggests how to distribute the observation 
points sufficiently densely located near the point of the lowest smoothness. The 
implementation of this design depends on some functional and numerical parame- 
ters, and we set up a certain number of mild assumptions about these parameters. 
In Section [3J our main results are stated. Namely, for a locally stationary process 
with known smoothness, we consider the piecewise constant interpolation related to 
dilated sampling designs (adjusted to smoothness parameters) and find the asymp- 
totic behavior of its approximation error. In the second part of that section, the 
approximation for conventional regular and some quasi-regular sampling designs are 
studied. In Section HJ the results and conjectures related to optimality of our bounds 
are discussed. Section [5] contains the proofs of the statements from Section [3J 



2 



2 Variable smoothness random processes and approxi- 
mation methods. Basic notation 



2.1 Approximation problem setting 

Let X = X(t),t E [0, 1], be an a(-)-locally stationary random process, i.e., K{X(t) 2 } < 
oo and 

Jim \\ X ( t + s )- X ( t )\\ 2 = c{t) uniform i y in t € [o, i] , (i) 

where := (EF 2 ) 1 / 2 , a(-),c(-) € C([0,1]) and 2 > a(t) > 0, c(t) > 0. 

We assume that the following conditions hold for the function a(-) describing 
smoothness of X: 

(CI) a(-) attains its global minimum ao := a(0) at the unique point to = 0. 
(C2) there exist b, 7 > such that 

a(t) = a + W + o(t 7 ) as t -> 0. 

The choice to = in (CI) is of course just a matter of notation convenience. The 
results are essentially the same for any location of the unique minimum of a(-). 

Let X be sampled at the distinct design points T n = (to(n), . . . , ijv(^)) (also 
referred as knots), where = to(n) < t\(n) < ••• < t^(n) = 1, N = N(n). 
We suppress the auxiliary integer argument n for design points t« = tj{n) and for 
number of points N = N{n) when doing so causes no confusion. 

The corresponding piecewise constant approximation is defined by 

X n {t) := X^), tj-x < t < tj, j = 1, . . . , N. 

In this article, we consider the accuracy of approximation to X by X n with respect 
to the integrated mean square error (IMSE) 

el = \\X-X n \\ 2 2 := f \\X(t)-X n (t 
Jo 



We describe now a construction of sampling designs {T n } providing the fastest decay 
of el 



a ■ 



2.2 Sampling design construction 

The construction idea is as follows. In order to achieve a rate-optimal approximation 
of X by X n , we introduce a sequence of dilated sampling designs {T n }. 

Recall first that any probability density f(t),t £ [0,1], generates a sequence of 
associated conventional sampling designs, (cf., e.g., [23], [7], [26]) defined by 

f{t)dt= J -, i = 0,...,n, (2) 
n 

i.e., the corresponding sampling points are (j/n)-percentiles of the density /(•), say, 
a sampling density. 
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Let p(-) be a probability density on M + := [0, oo). In our problem, it turns out to 
be useful to dilate the design density by replacing it with a dilated sampling density 



Pn (t) :=d n p{d n t), te[0,l], (3) 

where d n /* oo is a dilation coefficient. Note, that formally p n (-) is not a probability 
density, but 



1 

p n (t)dt = I p{u)du —7- 1 as n — > oo. 



'o 

The idea of dilation is obvious: we wish to put more knots near the point of the 
worst smoothness. The delation coefficient should be chosen accordingly to the 
smoothness behavior at this critical point. In our case, (C2) requires the choice 

d n : = (logn) 1 ^ 

that will be maintained in the sequel. As in ([2]), we define the knots by 

p n (t)dt = J -. (4) 
n 

Further optimization of the approximation accuracy bound requires one more ad- 
justment: it turns out to be useful to choose the knots tj as in (jl]) using different 
densities in a neighborhood of the critical point and outside of it. We call composite 
such sampling design constructions operating differently on two disjoint domains. 

Now we pass to the rigorous description of our sampling designs. Let p(u) and 
p(u),u € [0, oo), be two probability densities. Let the dilated densities p n {-) be 
defined as in ([3]). Similarly, p n {t) := d n p(d n t). 

For < p < 1, we define the composite dilated (p, p,p)- designs T n by choosing 
tj according to (jl|) for 

rp rpdn 
< j < J(p,p,n) := n p n (t)dt = n p(u)du < n. 



o 



Notice that for these knots, we have < t j < p. Furthermore, we fill the interval 
(p, 1] with analogous knots t{ using density p(-), 

U • 

p n (t)dt =-, (5) 

n 

where 

J(p,p,n) < j < J{p, l,n), 
i = j + J{p,P,n) - J(p,p,n). 
For these knots we clearly have p < ti < 1. Note, it follows by definition that 

rpdn 

J(p,p,n)r^n / p(u)du ~ n as n — > oo, 
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and similarly, in the interval [p, 1], the number of points does not exceed 

/•oo 

n — J(p, p, n) ~ n I p(u)du = o(n) as n — > oo, 

J pd n 

that is the total number of sampling points satisfies 

N(n) ~ J(p, p, n) ~ n as n — > oo. (6) 

In the sequel, we will use (p, p,p)-designs satisfying the following additional 
assumptions on p(-), p, and p(-): 

(Al) The design density p{-) is bounded, non-increasing, and 

p(u) > qi exp{-q 2 u' f }, u > 0, qi > 0, ^ > q 2 > 0. (7) 

(A2) We assume that p is regularly varying at infinity with some index r < — 1. 
This means that for all A > 0, 

1, / ->• A r as it ->• +oo. (8) 
p{u) 

In this case, we write p(-) G 7^ r (+oo). 

(A3) Finally, we assume that the parameter p is small enough. Namely, applying 
(/2 < b/ao and using (C2) we may choose p satisfying 

. . „ ait) — q(0) , , 

q 2 sup a(t) < mf W ^ (9) 

o<t<p o<t< P r< 

and 

< I- (10) 

For example, let a(t) = 1 + V . Then (CI), (C2) hold and (A3) corresponds to 
p < (l/q 2 - 1) 1/7 , where < q 2 < 1. 

Regularly varying densities satisfy (|7|) for large u, thus we could simplify the 
design construction by letting p = p. However, this simplified choice does not 
provide an optimal constant K in the main approximation error asymptotics ([lip 
below. 



3 Main results 

3.1 Dilated approximation designs 

In the following theorem, we give the principal result of the paper and consider 
IMSE of approximation to X by X n for the proposed sequence of composite 
dilated sampling designs T n ,n > 1. It follows from (Al) that the following constant 
is finite, 

c f'°° 

K = K(c,a,(p,p,p)):=^— / p(u)- a ° e~ bul du < oo, 

«o + 1 Jo 

where cq := c(0). 
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Theorem 1 Let X(t),t € [0,1], be an a(-) -locally stationary random process such 
that assumptions (CI), (C2) hold. Let X n be the piecewise constant approximations 
corresponding to composite dilated (p, p,p)- designs {T n } satisfying (Al)-(A3). Then 
N(n) ~ n and 

K K 

\\X — X n \\o ~ ; T7 — tttt- ~ ; 77 — rm o,s n —¥ oo. (11) 

n a o(logn)( Qo+1 )/T iV Q o(logiV)(ao+i)/7 v ; 

Remark 1. Among the assumptions of Theorem[Tl the monotonicity of p(-) is worth 
of a discussion. Of course, it agrees with the heuristics to put more knots at places 
where the smoothness of the process is worse. However, this assumption may be 
easily replaced by some mild regularity assumptions on p(-). 

Remark 2. The following density p*(-) 

p*(u) = Ce~ buJ/( - ao+l) , C = x1 f 7 . 

K ' (a + 1)^(1/7 + 1) 

minimizes the constant K in Theorem [T] and generates the asymptotically optimal 
sequence of designs T*. For the optimal T*, 

K * := ( r e - b uV( ao+ i) du \ ao+1 _ ^ ( («o + i)^r(i/ 7 + 1) x Qo+1 



ao + 1 \Jo J ao + 1 \ 6 1/7 J 

see, e.g., [26]. We stress that p*(-) satisfies assumption ([7]) but it is not regularly 
varying. In other words, a simple design based on p = p = p* does not fit in 
theorem's assumptions. 

Remark 3. The idea of considering composite designs might seem to be overcom- 
plicated from the first glance. However, in some sense it can not be avoided. The 
previous remark shows that if we want to get the optimal constant K, we must 
handle the exponentially decreasing densities. Assume that 

p(u) < qiexp{-q 2 u'}. (12) 

If we would simplify the design by defining tj(n) as in for the entire interval, 
i.e., with p = 1, then we would have 



p n {t)dt = -, 
n 



hence, 



/ p n {t)dt = / d n p(d n t)dt < d n qi / eKp{-q 2 {d n t) 1 }dt 



n 

< d n qi(t j+l - tj) ex.p{-q 2 (d n t j ) 1 }. 



Let a 6 (0, 1) and tj € [1 — a, 1]. Then for the length of corresponding intervals, 
we have 

. . . exp{q 2 (d n tjy} exp{g 2 logre(l - o) 7 } 

tj+l ~ tj > ; > 



nd n qi nd n qi 
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If 52 > 1 and a is so small that (72(1 — «) 7 > lj we readily obtain tj + i — tj > a for 
large n which is impossible. Therefore, for q<i > 1 there are no sampling points tj in 
[1 — a, 1], i.e., clearly > C > for any n, i.e., IMSE does not tend to zero at all. 

The confusion described above may really appear in practice because q 2 > 1 is 
compatible with assumption Q2 < from ([TJ whenever b > a>Q. 

Theorem Q] shows that for the design densities with regularly varying tails, we 
may define all knots by without leaving empty intervals as above. However, we 
can not achieve the optimal constant K on this simpler way. 

Remark 4. Actually, the choice of knots outside of [0, p] is not relevant for ap- 
proximation rate. One can replace the knots from ([5]) with a uniform grid of knots 
ti = in - ^ with appropriate // < 1. 

3.2 Regular sampling designs 

The approximation algorithm investigated in Theorem Q] is based upon the assump- 
tion that we know the point where a(-) attains its minimum, as well as the index 
7 in (C2). If for the same process neither critical point nor index 7 are known, a 
conventional regular design can be used. 

Let a random process X(t),t € [0, 1], be an a(-)-locally stationary, i.e., (pQ) hold. 
Consider now sampling designs T n = {tj(n), j = 0, 1, . . . , n} generated by a regular 
positive continuous density p(t), t € [0,1], (see, e.g., [23], [26]) through ([13]) . i.e., 

rU j 

p(t)dt = - , < j < n. (13) 

n 



Let the constant 



cq r(i/ 7 + i) 



Theorem 2 Let X(t),t € [0,1], be an a(-) -locally stationary random process and 
(CI), (C2) hold. Let X n be the piecewise constant approximations corresponding to 
(regular) sampling designs {T n } generated byp(-). Then 

,,2 Ki 
\\X — X n \\ 2 ~ ttt— as n — > 00. 



n ao (logn) 1 /'y 



Remark 5. If the point where a(-) attains its minimum, is known but 7 is unknown, 
we may build the designs without dilating the density. Instead, one could use quasi- 
regular sampling designs generated by a possibly unbounded density p(t),t € (0, 1], 
at the singularity point to = (cf., [2]). For example, if p(-) is a density on (0, 1] 
such that 

p(t) ~ A t~~ K as t \ 0, < k < 1, 

and f j (n) are chosen through f)13|) , then for a locally stationary process X satisfying 
(CI) and (C2), it is possible to show a slightly weaker asymptotics than that of 
Theorem [1] 

2 K 2 
e n ~ — r-j- as n -4 00, 

n Q0(l ogn )(l+Ka )/7 
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with K 2 := coA- Q °r(l/ 7 + l)/((a + 1)6 1/7 ). 

Of course, all above mentioned asymptotics differ only by a degree of logarithm 
while the polynomial rate is determined by the minimal regularity index ao- 



4 Optimality 

4.1 Optimality of the rate for piecewise constant approximations 

We explain here that the approximation rate Z" 1 = n~ a °d n ( - ao+1 ^ achieved in Theo- 
rem Q] is optimal in the class of piecewise constant approximations for every locally 
stationary process satisfying (CI) and (C2). For a sampling design T n , let the mesh 
size \T n \ := max{(tj — tj—i),j = 1, . . . , n}. 

Proposition 1 Let X n be piecewise constant approximations to a locally stationary 
process X satisfying (CI) and (C2) constructed according to designs {T n } such that 
N n ~ n and \T n \ — » as n — > oo. Then 

liminf l n e 2 n > 0. (14) 



n— >oo 



4.2 Optimality of the rate in a class of linear methods 

We explain here that the approximation rate Z" 1 achieved in Theorem [1] is optimal 
not only in the class of piecewise constant approximations but in a much wider class 
of linear methods, - at least for some locally stationary processes satisfying (CI) and 
(C2). The corresponding setting is based on the notion of Gaussian approximation 
numbers, or ^-numbers, that we recall here. 

Gaussian approximation numbers of a Gaussian random vector X taking values 
in a normed space X are defined by 

2 



l n (X-Xy= inf E 

?1 ,--,(n-l 



71-1 

X ~ Yl ^i x i 

3=1 



(15) 



.v. 



where infimum is taken over all Xj G X and all Gaussian vectors £ = (£i , . . . , £ n _ 
R n_1 , see [201 [21]. If X is a Hilbert space, then 



l n (X;Xf = J^X 



j=n 

where Xj is a decreasing sequence of eigenvalues of covariance operator of X. 

Recall that a multifractional Brownian motion (mBm) with a variable smooth- 
ness index (or fractality function) a(-) € (0,2) introduced in [6j [22] and studied in 
[31 [H [5] is a Gaussian process defined through its white noise representation 

/oo itu i 

.„ H («(t)+D/2 
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where W(t),t € ffi, is a conventional Brownian motion. Notice that mBm is a typical 
example of a locally stationary process whenever a(-) is a continuous function. 

In particular case of the constant fractality a(t) = a, we obtain an ordinary 
fractional Brownian motion B a , a € (0,2). For X = B a considered as an element 
of X = Z/2[0, 1], the behavior of eigenvalues \j is well known, cf. [9]. Namely, 

Xj ~ c Q j _a_1 as j — > oo 

with some c a > continuously depending on ol G (0, 2). It follows that 

l n (B a ;L 2 [0,l]) 2 ~ a~ l c a n- a as n -> oo. 

Hence, for all n > 1, 

£ n (lT;L 2 [0,l]) 2 >C a n- Q . 

Furthermore, since -B a is a self-similar process, we can scale this estimate from 
X = Z/2[0, 1] to X = L2[0,r] with arbitrary r > 0. An easy computation shows that 

£ n (B a ;L 2 [0,r}) 2 = r a+1 e n (B a ;L 2 [0,l}) 2 > C a r a+1 n^. 

Let us now consider a multifractional Brownian motion X parameterized by a frac- 
tality function a(-) satisfying (C2). For example, let 

a (t) := a + < t < 1, (16) 

with ao, & > chosen so small that oto + b < 2. This choice secures the necessary 
condition 

< a(t) < 2, < t < 1. 
Then, letting r = r n := d~ l , we have 

4(^;£ 2 [0,1]) 2 > £„(X;L 2 [0 ) r n ]) 2 >M4(5 Q ( r «);L 2 [0 ) r ri ]) 2 

> C^W-^ = CC^^n)"^ 
= Cl-^dnn)- 6 ^")" 1 >Cl~\ 

for some positive M, C a r r \ } C, C. All bounds here are obvious except for the second 
inequality comparing approximation rate of multifractional Brownian motion with 
that of a conventional fractional Brownian motion. 
We state this fact as a separate result. 

Proposition 2 Let X(t),a < t < b, be a multifractional Brownian motion cor- 
responding to a continuous fractality function a : [a,b] —> (0,2). Let B 13 be a 
fractional Brownian motion such that inf a < t <b a(t) < (3 < 2. Then there exists 
M = M(a(-),/3) > such that 

£ n (X;L 2 [a,b\) > Ml n {B$ ,L 2 [a,b]), n > 1. 
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The proof of this proposition requires the methods very different from those used in 
this article. We relegate it to another publication. 

Our conclusion is that a multifractional Brownian motion with fractality function 
()16p provides an example of a locally stationary process satisfying assumptions (CI) 
and (C2) such that no linear approximation method provides a better approximation 
rate than 



5 Proofs 

Proof of Theorem Q]; 

We represent the IMSE e n = — X n (i)||| as the following sum 

e '=E / 1 \\X(t)-X n (t)\\ 2 dt = J2 ' \\X{t)-X{t^ x )\\ 2 dt=:Y,<r (") 
j=\ j=i Jtj-i j=\ 

Next, for a large U > 0, let 

N 

e2 n = Yl e n,j = Sl+S 2 + S3, 
3=1 

where the sums Si, 82,83 include the terms e n j such that [tj-i,tj] belongs to 
[0,U/d n ], [U/d n ,p], and [p, 1], respectively. Let J\ and J2 denote the corresponding 
boundaries for index j. Denote by 

l n := n a °d^ 0+l = n a °(logn)( ao+1 >/ 7 . 

Recall that I" 1 is the approximation rate announced in the theorem. We show that 
only Si is relevant to the asymptotics of e n , namely, that l n Ss = o(l) as n — > 00, 
while 

limsup l n S2 = o(l) as U — > 00. (18) 



n— Kx> 



Let Wj := tj — tj-i, Uj := d n tj be the normalized knots and denote by Vj 
Uj — Uj-i = d n Wj the corresponding dilated interval lengths. 

It follows by the definition of a(-)-local stationarity ([T]) that for large n, 



e 2 



t 

*3-l 



(*,•_!) / 3 (t-t^i) a ^dt(l + r n j) 



= B^i (tj - t J _ 1 ) a fe- 1 )+ 1 (l + r n j) 

= B^i (v j /d n ) a ^ +1 (l + r nj -), (19) 
where |T n | = maxj ioj = o(l) and max.,- r n j = o(l) as n — > 00 and 

J3.-:= ^ , j = l,...,N. 

First, we evaluate S3. Recall that for j > J2 we have pd n < uj-i < Uj < d n . 
We use now the following property of regularly varying functions (see, e.g., [TO]): 
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convergence in ([8]) is uniform for all intervals 0<a<A<6<oo. Using this 
uniformity we obtain, for some C\ > 0, 

inf p(u) > inf p(u) > C\p(d n ). 

Uj-i<u<Uj pd n <u<d n 

It follows by © that 

/ p(u)du = / p n {t)dt = - . 

Hence, for some C2 > 0, 

( \~ X P ~ 1 ( 

Vj < I inf p(u) I / p(u)du < — I inf p{u) 

^ 1 IL F\ u "n ) 1 1 

and maXj>j 2 tOj = <4l' /n. Recall that by assumption (CI), 

«l := inf a(i) > ao- 
te[ P ,i] 

Therefore, for large n, we get by (fT9)) and ([20]) . 6*3,(74 > 0, 

>IOi+i) 

5 3 < nnmxelj < nC 3 {vj/d n ) ai+1 < d^-^i — = o(/~ 1 ) as n -> 00. (21) 

Now consider the first two zones corresponding to Si, S^- We have by definition 

I ' p(u)du = i- , < j < J. 

Jo n 

Since the function p n (t),t € [0,1], is non-increasing, the sequence {vj} is non- 
decreasing. In fact, 



1 



p{u)du G \p(uj)vj,p(uj-i)vj], 



and therefore, 

np(uj^i) ~ i ~ np(uj) ~ J+1 ^ 
and it follows by (Al) that maxj<j 2 Wj = o(l) as n — > 00. For j < J2, the bounds 
(fT9j) and ([22} yield for n large, 



c 2 



= Bi-ife/4)*-^(l + o(l)) 

< B^iK^K))-^ 1 ' ^(1 + 0(1)) 



< B^inpiu^-^-^d-^v^l + o(l)) 

= 5 i _ 1 /- 1 n-( Q ^- l) - ao Vn i )" Q(tj - l) ^(l + o(l)). (23) 

From now on, we proceed differently in the first and in the second zone. 
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For the second zone, J\ < j < J 2 , we do not care about the constant by using 

Bj <B 1f : = max . (24) 

3 ~ o<t<i a(t) + 1 v ' 

Next, © and @ give 

p( Uj )- a ^ < Cexp{q 2 a(t J . 1 )u]} < Cexp{/3 lU ]}, C> 0, (25) 
where fi\ := g 2 sup 0<t<p a(t). On the other hand, we infer from © that 

_a(tj_l)-«0 t 7 u7_j 
n -(a(%_i)-oo) = n < „-fr Togn" = exp{-/3 2 ttj.j}, (26) 

where /3 2 := info<t< P (a(i) - ao)A 7 > A by Q. 

Recall that by JTOJ), we have 1 — Q2P^ ^ 0. Moreover, for £/ ^ ia^ ^ pd nj we 
derive from ([7]) and ([22]) 



«7 < n-VCprfn) -1 < Cn- 1 exp{q 2 (pd n )' r } = Cn^ 1 " 92 ^, C > 0, 



and it follows 



777 
n J+i " = u l-i 



{{^y - 1 )=° (^«" (1_92p7) ) = o(l) as n -> 00 (27) 



uniformly in J\ < j < J 2 . 

Since {vj} is non-decreasing, (I27p implies an integral bound 

exp{-/3 2 uj_ 1 }exp{y8iuj}u j - 
= exp{/3 2 - uJ_J} exp{/3iuj - /3 2 uJ +1 }vj 

< C inf exp{/3iii 7 — /3 2 u 7 }i;j_|_i 

< C e'^-^du, C>0. (28) 



By plugging (f24l) . (j26j) . and (f28l) into (|23|) . and summing up the resulting bounds 
over Ji < j < J 2 , we obtain 

S 2 < BJ' 1 / e^ 2 "^" 7 ^ (1 + o(l)) asn^oo. (29) 
Ju 



Therefore, (fl~8j) is valid. 

In the first zone, j < Ji, ij < U/d n , the knots are uniformly small. Hence, 
Bj-i are uniformly close to B due to the continuity of the functions a(-) and c(-). 
Moreover, by (C2) for any e > 0, we have for all n large enough 

a + (b- e)t]_ x < ait,-!) < a + (b + e)t]_ v j < Ji- (30) 

Hence (EHJ) yields 



.7 



(B + e)/^ 1 n -(b-e)(^-i/rf») 7 p(^)- a Op( Ui )-( Q fe-i)- a o) Vj . (31) 
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Recall that by the definition of d n , we have 

Since p(-) in non-increasing and {vj} is non-decreasing, we also have an integral 
bound 

exp{-(6 - ejuj.-j p(uj)~ ao Vj 
= exp{(6 - e)[uj +1 - uJ_J} exp{-(6 - e)uj +1 }«j 

< eW {(b-e)[n] +1 -u]^}} inf (pW^e-^^) Vj 

< exp{(b - e)[u] +1 - u]^]} p{u)- a °e-^ ul du. (32) 

JUj 

Moreover, for Uj < U, we derive from (Al) and (122 H 

< n -1 p(E/) -1 . 

By using convexity and concavity of the power function for 7 > 1 and 7 < 1, 
respectively, we get 

u] +l - u]_ t < 7J7 7 ~ 1 (n i+ i - uj) = 7J7 7_1 (^ + v j+1 ) 

< 2jU"'~ 1 Vj + i = o(l) as n -> 00 (7 > 1); 

- mJ_! < (n i+ i - Uj_i) 7 

= (vj + Vj + i) 7 = o(l) as n — > 00 (7 < 1). 

Therefore, the exponential factor in (j32|) turns out to be negligible. 
Finally, for Uj < U, the property d n — > 00 yields 

p^j-)-^^- 1 )-^) < max{l,p(f/)- max °<'<^( Q W- ao) } = l + o(l). (33) 

By plugging ([32]) and ([33]) into (f3~Tj) . and summing up the resulting bounds over 
j < Ji, we obtain 

POD 

S 1 <(B + 2e)l- 1 / p{u)- ao e- {b - e)ul du as ra -> 00. 
Jo 

Since e can be chosen arbitrarily small, we arrive at 

limsup l n S! <B / p(n)- Q0 e- bu7 (in = if. (34) 

n— >oo Jo 

Combining (j2Tjh (f29|h and (f34"|) gives the desired upper bound. 

The lower bound is obtained along the same lines: we neglect S2 and S3, and 
evaluate S\ starting again from f)19|) . As in ()23|) . we have 

<. = B^^./^-^^a+oa)) 

> B^iKM^-i))-" 1 ^- 1 ' ^-(1 + 0(1)) 

= ^^in-^n^^^- 1 )"" ^^^!)^^- 1 )^ 00 " 1 ^""^" ^^! + o(l)) 
= B i _ 1 /- 1 n-( Q ^- 1 )- Q °)p(n i _ 1 )- a fe- 1 )C~ Q(t ^ l) ^(l + o(l)). (35) 

13 



Recall that for j < J\, coefficients -Bj-i are uniformly close to B. Moreover, by 
using ([30]) . we have for large n, 



> d ~^U > d -(b+e)(U /dn r = 1 + o(1) . 

Hence, ([35]) yields 

e 2 njj > (B — eX^^U p(« i _ 1 )- a «'p(u i _ 1 )-(«(*i-i)- 01 <') Vj 

= (B — e)/" 1 n -M(«,--iW p( Uj . l )- ffi «p( 1 j j . l )-( (, ( t i-i)- (, «) u,., (36) 

where as before 

n -(6+e)(« J _ 1 /d n )'» = n -(6+ e )«7_ 1 /logn = exp{-(6 + ejtij.j. 

Since p(-) is non-increasing and {vj} is non-decreasing, we also have an integral 
bound 

exp{-(& + e)uJ_Jp(u j _ 1 )- a ° Uj 
= exp{(6 + e)[nj_ 2 - u]„ 1 ]}p(n i _i)- Q ° exp{-(6 + e)uj_ 2 }uj 



> exp{(b + e)[u]_ 2-uT.J} inf (p(u)- ao e-( b+£ ) u7 ) 

> e^{(6 + e)[«J +1 -uJ_J} P^p^-^e-^ 6 ^^. (37) 



We have already seen that the exponential factor in (J37J) is negligible. 
Finally, for iij < U, the property d n — > oo implies (cf., (f33l) 

p(u j _i)-W*''- 1 5 -a °) > min{l,p(0)- maxo <'<^ {Q{t) - ao) } = 1 + o(l). (38) 



By plugging (1371) and (|38j) into (j36j) . and summing up the resulting bounds over 
j < we obtain 



n — f oo. 



Si > {B - 2e)C x I p{u)- ao e-( h+e)ul du as 
Jo 

Since e can be chosen arbitrarily small, we arrive to 

rU 

liminf l n Si > B / p(u)- ao e- bul ' du. 

n->oo J Q 

Finally, 

/■oo 

liminf / n e^ > sup liminf l n Si >B p(u)- a ° e~ buJ du = K. (39) 

n->oo u>0 n-Hx> _/ 

This is the desired lower bound. □ 
Proof of Theorem 0.' 

Applying the notation of Theorem [H we have for an interval approximation error 
e l,j = B 3-i wf t3 ~ l)+l (1 + r n j), wj = tj - tj-i,j = 1, . . . ,n, 
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where max.,- r n j = o(l) as n — > oo. Now for a small enough p > 0, similarly to 
Theorem [H we get 

/ e n (t) 2 dt < C/n a \ C > 0, a x := inf a(t) > a , 

that is only e n j such that C [0, p) are relevant for the asymptotics, say, 

e n,j,j = 1,..., J= J(p,n). Let us denote the approximation rate L n := n Q °(logn) 1 / 7 . 
Next, for S\ := e ™,j an d small enough we have by continuity of the density 
p(-) and the mean value theorem 



e 2 n>j = B 5 - X (np^r^-^Wj (1 + o(l)) 
< — ^— (l + e)n- ao /'"'e-fHt^g^n+om) 

- p(o) q « v ; 4_ 2 v 

= L" 1 -^(l + e) T" 1 e-( fe -^ 7 ^ (1 + o(l)), 
where po ■= p(0). Now by summing up, we obtain 

r(i/ 7 + i) 



limsupL^! < — (1 + e) / e~^ ul du = — (1 + e) 

n^oo P Jo Po 



Hence, 



limsupL n e n = lim sup L n Si < (1 + e) 



n— >oo n— too Pq° (p ~ e) 1 ^ 

Since e can be chosen arbitrary small, we get 

2 B IYI/7 + I) 
hmsu P L n e n < — = K x . 

n— >oo Po * ' 

The lower bound follows similarly. This completes the proof. □ 
Proof of Proposition \Q 

Let r n := d~ l = (logn) -1 / 7 and J n := inf{j : tj = tj(n) > r n }. Then (fTUj) implies 

4 > £<; = ^B^f^^a +o(i)) = +o(i)), 
j=i j=i j=i 

where a n := sup 0<t<rn ot(t) and iOj = tj — tj-\. By using convexity of the power 
function w — > u> an+1 , we obtain 



1 Jn I -| Jn 



a n +l 

, 0» + l 



hence, 

Jn r an+l r a n +l 
On+l > £n > ^ 

J - 7 a ™ — /\T a " 

, iV n 
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whereas 

4 > B (l + o(l)) =B (l + (l)) 

J v n fln J v n 

- U) (1+0(1)) 

(i \ On— ao 

Recall that by (C2), a n - a = 0{rl) = O^logra)- 1 ) and thus CHD follows. □ 
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